An Efficient Architecture for Information Retrieval in P2P Context Using Hypergraph
نویسندگان
چکیده
Peer-to-peer (P2P) Data-sharing systems now generate a significant portion of Internet traffic. P2P systems have emerged as an accepted way to share enormous volumes of data. Needs for widely distributed information systems supporting virtual organizations have given rise to a new category of P2P systems called schema-based. In such systems each peer is a database management system in itself, ex-posing its own schema. In such a setting, the main objective is the efficient search across peer databases by processing each incoming query without overly consuming bandwidth. The usability of these systems depends on successful techniques to find and retrieve data; however, efficient and effective routing of content-based queries is an emerging problem in P2P networks. This work was attended as an attempt to motivate the use of mining algorithms in the P2P context may improve the significantly the efficiency of such methods. Our proposed method based respectively on combination of clustering with hypergraphs. We use ECCLAT to build approximate clustering and discovering meaningful clusters with slight overlapping. We use an algorithm MTMINER to extract all minimal transversals of a hypergraph (clusters) for query routing. The set of clusters improves the robustness in queries routing mechanism and scalability in P2P Network. We compare the performance of our method with the baseline one considering the queries routing problem. Our experimental results prove that our proposed methods generate impressive levels of performance and scalability with with respect to important criteria such as response time, precision and recall.
منابع مشابه
Context-based Information seeking behavior among students of Kharazmi University
Background and Aim: The present study has been done in order to survey contextualized information retrieval behavior by the students of Kharazmi University. Methods: This is descriptive applied research. Statistical population includes all the students currently studying at the Kharazmi University in the time of research. Sample of research includes 196 students selected by convenience sampling...
متن کاملCooperating Peers for Content-Oriented XML-Retrieval
Semi-structured documents formatted with the extensible markup language (XML) are gaining wide use by a whole range of applications including E-Commerce, E-Business, EScience, Digital Libraries (DL), File Sharing, and in the last years especially by applications for Peer-to-Peer (P2P) systems. P2P architectures have been identified as an efficient means of ad-hoc collaboration and information s...
متن کاملIntelligent Content-Based Retrieval for P2P Networks
Currently, most peer-to-peer (P2P) systems are designed for file sharing by network participants. Simple meta-data search mechanism will be sufficient to support searching and retrieving shared files over P2P networks. However, to share document information such as news articles, scientific publications, company reports, etc., a content-based search mechanism is needed to provide efficient cont...
متن کاملA Model for Decentralized Information Dissemination
Peer-to-Peer computing paradigm may provide a solution to the retrieval problem in an ever burgeoning volume of online and digital information. While research has focused on the means of collaboration as a tool for query routing, we feel that there is a disconnect in the way P2P networks are handled and the expectations of performance in the real world. In the proposed work, we discuss the need...
متن کاملPeer-to-Peer Keyword Search Using Keyword Relationship
Decentralized and unstructured peer-to-peer (P2P) networks such as Gnutella are attractive for Internet-scale information retrieval and search systems because they require neither any centralized directory nor any centralized management of overlay network topology and data placement. However, due to this decentralized architecture, current P2P keyword search systems lack useful global knowledge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1108.1378 شماره
صفحات -
تاریخ انتشار 2011